M Mining Group Differences

نویسندگان

  • Shane M. Butler
  • Geoffrey I. Webb
چکیده

Finding differences among two or more groups is an important data-mining task. For example, a retailer might want to know what the different is in customer purchasing behaviors during a sale compared to a normal trading day. With this information, the retailer may gain insight into the effects of holding a sale and may factor that into future campaigns. Another possibility would be to investigate what is different about customers who have a loyalty card compared to those who don’t. This could allow the retailer to better understand loyalty cardholders, to increase loyalty revenue, or to attempt to make the loyalty program more appealing to noncardholders. This article gives an overview of such group mining techniques. First, we discuss two data-mining methods designed specifically for this purpose—Emerging Patterns and Contrast Sets. We will discuss how these two methods relate and how other methods, such as exploratory rule discovery, can also be applied to this task. Exploratory data-mining techniques, such as the techniques used to find group differences, potentially can result in a large number of models being presented to the user. As a result, filter mechanisms can be a useful way to automatically remove models that are unlikely to be of interest to the user. In this article, we will examine a number of such filter mechanisms that can be used to reduce the number of models with which the user is confronted.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Increasing drying efficiency by modifying the design of feed chute of the Gol-e-Gohar pelletizing plant ball mill

Pelletizing plant of the Gol-E-Gohar mining and industrial company consists of a burner, a dry ball mill (6.2 m × 13 m), and an air separator. The ball mill consists of a 2 m-long drying and an 11 m-long grinding chambers. The iron ore concentrate is fed to the drying chamber by a feed chute. It was found that when the feed moisture content increased from 1.3% to 3.5%, the throughput decreased ...

متن کامل

A Proposed Biochemical Protocol to Isolate and Characterize Acidophilic Bacteria from Tailings Soil

Indigenous acidophilic bacteria separated from mine-waste can be used in return for the addition of the reagents like sulfuric acid. Among the tailings bacteria, Acidithiobacillus ferrooxidans and Acidithiobacillus thiooxidans are of the most-studied ones for the bioleaching and bioremediation of elements. In this work, the isolation and characterization of the mentioned bacteria are studied by...

متن کامل

GIS modelling for Au-Pb-Zn potential mapping in Torud-Chah Shirin area-Iran

One of the major strengths of a Geographic Information System (GIS) in geosciences is the ability to integrate and combine multiple layers into mineral potential maps showing areas which are favorable for mineral exploration. These capabilities make GIS an extremely useful tool for mineral exploration. Several spatial modeling techniques can be employed to produce potential maps. However, these...

متن کامل

Semi-quantitative environmental impact assessment and sustainability level determination of coal mining using a mathematical model

Environmental impact assessment (EIA) has led to the dominance of planners on the natural environment of the regions, providing the possibility of continuously monitoring and controlling the status quo by management staff. In this regard, a new semi-quantitative model is presented for the EIA of the Eastern Alborz Coal Mining complex using the matrix method, and determining the corresponding im...

متن کامل

Multidimensional Process Mining: Questions, Requirements, and Limitations

In: S. España, M. Ivanović, M. Savić (eds.): Proceedings of the CAiSE’16 Forum at the 28th International Conference on Advanced Information Systems Engineering, Ljubljana, Slovenia, 13-17.6.2016, published at http://ceur-ws.org Abstract. Multidimensional process mining is an emerging approach that adopts the concept of data cubes to analyze processes from multiple views. This enables analysts t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015